(19) 



(12) 



Europaisches Patentamt 
European Patent Office 

Office feuropeen des brevets (1 1 ) EP 1 036 845 A2 

EUROPEAN PATENT APPLICATION 

(51) Intel. 7 : C12Q 1/68, G06F 19/00 



(43) Date of publication: 

20.09.2000 Bulletin 2000/38 

(21) Application number: 00102262.3 

(22) Date of filing: 15.02.2000 



® 



(84) Designated Contracting States: 


• Truong, Vivi 


AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


San Jose,CA 95120 (US) 


MC NL PT SE 


• Haber, Daniel 


Designated Extension States: 


Chestnut Hill, MA 02467 (US) 


AL LT LV MK RO SI 


• Lee, Sean 




Maiden, MA 02148 (US) 


(30) Priority: 24.02.1999 US 256301 




(74) Representative: 


(71) Applicant: Affymetrix, Inc. 


UEXKULL & STOLBERG 


Santa Clara, CA 95051 (US) 


Patentanwalte 




Beselerstrasse 4 


(72) Inventors: j 


22607 Hamburg (DE) 


• Oliner, Jonathan 




Mountain View, CA 94043 (US) | 





(54) Downstream genes of tumor suppressor WT1 

(57) Methods are provided for diagnosing cancers, 
drug-screening, and functionally analyzing mutations 
involving the WT1 gene. The methods involve use of the 
newly identified set of genes which are regulated by 
WT1 as well as by the set of genes which are regulated 
by WT1 fusions to EWS. Monitoring expression levels of 
these sets of genes can be used as an indicator of the 
genetic status of the gene. It can also identify which 
have similar effects on down-stream genes. 
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Description 

BACKGROUND OF THE INVENTION 

5 [0001] Many biological functions are accomplished by altering the expression of various genes through transcrip- 
tional (e.g. through control of initiation, provision of RNA precursors, RNA processing, etc.) and/or translational control. 
For example, fundamental biological processes such as cell cycle regulation, cell differentiation and cell death, are 
often characterized by the variations in the expression levels of groups of genes. 

[0002] Gene expression is also associated with pathogenesis. For example, the lack of sufficient expression of 
w functional tumor suppressor genes and/or the over expression of oncogene/protooncogenes could lead to tumorigene- 
sis (Marshall, Cell, 64: 313-326 (1991); Weinberg, Science, 254: 1138-1146 (1991), incorporated herein by reference 
for all purposes). Thus, changes in the expression levels of particular genes (e.g. oncogenes or tumor suppressors) 
serve as signposts for the presence and progression of various diseases. There is a need in the art for discovering 
which genes are affected by particular tumor suppressors and oncogenes, so that they can be used diagnostically as 
15 well as in the search for new therapeutics. 

SUMMARY OF THE INVENTION 

[0003] It is an object of the invention to provide methods for identifying a functional mutation in a WT1 gene. 
20 [0004] It is another object of the invention to provide methods for identifying a EWS-WT1 gene fusion. 

[0005] It is still another object of the invention to provide an in-cell assay to test the biological effect of a WT1 muta- 
tion. 

[0006] It is yet another object of the invention to provide methods to detect a WT1 mutation or a EWS-WT1 fusion 
using a computer. 

25 [0007] It is a further object of the invention to provide methods to diagnose neoplasia. 

[0008] It is another object of the invention to provide methods for identifying drugs useful for treating neoplasias. 
[0009] These and other objects of the invention are provided by one or more of the following embodiments. In one 
embodiment a method is provided for detecting a WT1 gene functional mutation in target cells. Expression of at least 
three down-stream genes of WT1 is detected in a sample of (a) target cells, and (b) reference cells having a wild-type 

30 WT1 gene. The reference cells are otherwise substantially similar to the target cells. The down-stream genes are up- 
or down-regulated by the wild-type WT1 gene. The expression of the down-stream genes in the target cells and the ref- 
erence cells is compared; a difference in the expression between the target cells and reference cells suggests a WT1 
functional mutation in the target cells. 

[0010] According to yet another embodiment a method is provided for detecting a EWS-WT1 gene fusion in target 

35 cells. Expression of one or more down-stream genes of an EWS-WT1 fusion is detected in a sample of(a) target cells, 
and (b) reference cells having a wild-type EWS and WT1 gene. The reference cells are otherwise substantially similar 
to the target cells. The down-stream genes are up- or down-regulated by the wild-type EWS and WT1 gene. The 
expression of the down-stream genes in the target cells and the reference cells is compared. A difference in the expres- 
sion between the target cells and reference cells suggests a EWS-WT1 fusion in the target cells. 

40 [001 1 ] Another aspect of the invention is an in-cell functional assay for a WT1 sequence alteration. The expression 
is detected of at least three down-stream genes in a target sample from target cells having a WT1 sequence alteration 
and in a reference sample from reference cells having a wild-type WT1 gene. The reference cells are otherwise sub- 
stantially similar to the target cells. The down-stream genes are up- or down-regulated by the wild-type WT1 gene. The 
expression in the target sample is compared to the expression in the reference sample. A difference in the expression 

45 between the two samples suggests that the WT1 sequence alteration affects the biological function of WT1 . 

[0012] According to another aspect of the invention a method is provided for detecting a mutation in a target WT1 
gene using a computer. Wild-type expression data of at least three down-stream genes in a wild-type sample containing 
a wild-type WT1 gene is input into a computer. The down-stream genes are transcriptionally regulated by the wild-type 
WT1 gene. Target expression data of the plurality of down-stream genes in a target sample containing the target WT1 

so gene is also input into a computer. The target and wild-type expression data are compared to determine differences. 
Differences suggest a mutation in the target WT1 gene. 

[0013] In still another embodiment a method is provided for detecting a translocation fusing EWS and WT1 genes 
using a computer. Wild-type. expression data of a plurality of down-stream genes in a w ild-t ype sample containing wild- 
type EWS and WT1 genes is input into a computer. The down-stream genes are transcriptionally regulated by a EWS- 
55 WT1 fusion protein. Target expression data of the plurality of down-stream genes in a target sample which is being 
tested for the presence of a EWS-WT1 fusion protein is also input into a computer. The target and wild-type expression 
data are compared to determine differences. Such differences suggest a translocation fusing the EWS and WT1 genes. 
[0014] Another embodiment of the invention provides a method of diagnosing neoplasia of a test cell. A transcrip- 
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tion indicator of a test cell is hybridized to a set of nucleic acid probes. The transcription indicator is selected from the 
group consisting of mRNA, cDNA and cRNA. The set of nucleic acid probes comprises a plurality of nucleic acid mole- 
cules each of which is a portion of a gene which is activated by or repressed by WT1 selected from the group consisting 
of: natural killer cells protein 4 precursor (M59807), heat shock protein HSP70B (X51 758), 90K product (H1 7969), heat 

5 shock 70kd protein 1 (T66307), procollagen alpha 1 (T51558), cysteine protease inhibitor from radiated keratinocytes 
(X05978), brain natriuretic protein (BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 
17 keratin (R71870), purine nucleoside phosphorylase (T47964), adrenomedullin (D14874), gravin (M96322), jun-B 
(X51345), elongation factor 1 alpha-2 (X79490), homeotic gene regulator (R16977), clone 9112 (X57348), interferon 
gamma treatment inducible mRNA (M26683), transmembrane receptor protein (Z17227) and mitochondrial phosphate 

10 carrier protein (R49231). Amounts of transcription indicator which hybridize to each of the set of nucleic acid probes are 
detected. A test cell is identified as neoplastic if (1) hybridization of the transcription indicator of the test cell to a probe 
which is a WT1 -activated gene is lower than hybridization using a transcription indicator from a normal cell, or (2) 
hybridization of the transcription indicator of the test cell to a probe which a WT1 -repressed gene is higher than hybrid- 
ization using a transcription indicator from a normal cell. 

15 [0015] A further aspect of the invention is provided by a method of identifying anti-cancer drugs. A test compound 
is contacted with a human cell. The effect of th.e test compound on the expression by the cell of at least one WT1 up- 
or down-regulated gene is determined. The regulated gene is selected from the group consisting of: natural killer cells 
protein 4 precursor (M59807), heat shock protein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 
1 (T66307), procollagen alpha 1 (T51558), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natri- 

20 uretic protein (BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin, purine nucl- 
eoside phosphorylase (T47964), adrenomedullin (D14874), gravin (M96322), jun-B (X51345), elongation factor 1 
alpha-2 (X79490), homeotic gene regulator (R16977), clone 9112 (X57348), interferon gamma treatment inducible 
mRNA (M26683), transmembrane receptor protein (Z17227) and mitochondrial phosphate carrier protein (R49231). A 
test compound is identified as a potential anti-cancer drug if it increases expression of at least one WT1 up-regulated 

25 gene or decreases expression of a WT1 down-regulated gene in the human cell. 

[0016] Another aspect of the invention is a method of diagnosing neoplasia of a test cell. A transcription indicator 
of a test cell is hybridized to a set of nucleic acid probes. The transcription indicator is selected from the group consist- 
ing of mRNA, cDNA and cRNA. The set of nucleic acid probes comprises a plurality of nucleic acid molecules each of 
which is a portion of a gene which is activated by or repressed by a EWS-WT1 fusion protein. Amounts of transcription 

30 indicator which hybridize to each of the set of nucleic acid probes are detected. A test cell is identified as neoplastic if 
(1) hybridization of the transcription indicator of the test cell to a probe which is a EWS-WT1 -activated gene is lower 
than hybridization using a transcription indicator from a normal cell, or (2) hybridization of the transcription indicator of 
the test cell to a probe which a EWS-WT1 -repressed gene is higher than hybridization using a transcription indicator 
from a normal cell. 

35 [001 7] According to still another embodiment of the invention a method of identifying anti-cancer drugs is provided. 
A test compound is contacted with a human cell. Expression of an EWS-WT1 up-regulated gene or a EWS-WT1 down- 
regulated gene is determined. A test compound is identified as a potential anti-cancer drug if it decreases expression 
of a EWS-WT1 up-regulated genes or increases expression of a EWS-WT1 down-regulated gene in the human cell. 
[001 8] A method is also provided for detecting a WT1 gene functional mutation in target cells. Expression of at least 

40 one down-stream gene of WT1 is detected in a sample of (a) target cells, and (b) reference cells having a wild-type WT1 
gene. The reference cells are otherwise substantially similar to the target cells. The down-stream genes are selected 
from the group consisting of: natural killer cells protein 4 precursor (M59807), folate binding protein (M25317), 
HALPHA44 gene for alpha-tubulin (X06956), heat shock protein HSP70B (X51 758), 90K product (H17969), heat shock 
70kd protein 1 (T66307), amphiregulin (M30704), procollagen alpha 1 (T51558), beta-migrating plasminogen activator 

45 inhibitor 1 (M14083), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriuretic protein (BNP) 
(M31 766), leukemia virus receptor 1 (GLVR1 ) (L20859), type 1 cytoskeletal 1 7 keratin, tubulin alpha-5 chain (H45051 ) t 
purine nucleoside phosphorylase (T47964), Gem GTPase (U10550), adrenomedullin (D14874), GPI-anchored uroki- 
nase plasminogen activator surface receptor (R38636), tissue type plasminogen activator (X13097), gravin (M96322), 
jun-B (X51345), elongation factor 1 alpha-2 (X79490), homeotic gene regulator (R16977), clone 91 12 (X57348), inter- 

so feron gamma treatment inducible mRNA (M26683), transmembrane receptor protein (Z17227) mitochondrial phos- 
phate carrier protein (R49231), and caveolin. The expression of the at least one down-stream gene in the target cells 
and the reference cells is compared. A difference in the expression between the target cells and reference cells sug- 
gests a WT1 functional mutation in the target cells. 

[001 9] An in-cell functional assay for a WT1 sequence alteration is also provided by the present invention. Expres- 
55 sion is detected in a target sample from target cells having a WT1 sequence alteration and in a reference sample from 
reference cells having a wild-type WT1 gene of at least one down-stream gene selected from the group consisting of 
natural killer cells protein 4 precursor (M59807), folate binding protein (M25317), HALPHA44 gene for alpha-tubulin 
(X06956), heat shock' protein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 1 (T66307), 
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amphiregulin (M30704), procollagen alpha 1 (T51558), beta-migrating plasminogen activator inhibitor 1 (M14083), 
cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriuretic protein (BNP) (M31766), leukemia 
virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin, tubulin alpha-5 chain (H45051), purine nucleoside 
phosphorylase (T47964), Gem GTPase (U10550), adrenomedullin (D14874), GPI-anchored urokinase plasminogen 

5 activator surface receptor (R38636), tissue type plasminogen activator (X13097), gravin (M96322), jun-B (X51345), 
elongation factor 1 alpha-2 (X79490), homeotic gene regulator (R16977), clone 9112 (X57348), interferon gamma 
treatment inducible mRNA (M26683), transmembrane receptor protein (Z17227), caveolin and mitochondrial phos- 
phate carrier protein (R49231). The reference cells are otherwise substantially similar to the target cells. The down- 
stream genes are up- or down-regulated by the wild-type WT1 gene. Expression in the target sample is compared to 

io the expression in the reference sample. A difference in the expression between the two samples suggests that the WT1 
sequence alteration affects the biological function of WT1 . 

[0020] Another embodiment of the invention provides a method for detecting a mutation in a target WT1 gene using 
a computer. Wild-type expression data of at least one WT1 -down-stream gene in a wild-type sample containing a wild- 
type WT1 gene is input into a computer. The down-stream gene is selected from the group consisting of: natural killer 

15 cells protein 4 precursor (M59807), folate binding protein (M25317), HALPHA44 gene for alpha-tubulin (X06956), heat 
shock protein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 1 (T66307), amphiregulin (M30704), 
procollagen alpha 1 (T51558), beta-migrating plasminogen activator inhibitor 1 (M14083), cysteine protease inhibitor 
from radiated keratinocytes (X05978), brain natriuretic protein (BNP) (M31766), leukemia virus receptor 1 (GLVR1) 
(L20859), type 1 cytoskeletal 17 keratin, tubulin alpha-5 chain (H45051), purine nucleoside phosphorylase (T47964), 

20 Gem GTPase (U1 0550),. adrenomedullin (D14874), GPI-anchored urokinase plasminogen activator surface receptor 
(R38636), tissue type plasminogen activator (X13097), gravin (M96322), jun-B (X51345), elongation factor 1 alpha-2 
(X79490), homeotic gene regulator (R16977), clone 9112 (X57348), interferon gamma treatment inducible mRNA 
(M26683), transmembrane receptor protein (Z17227), caveolin, and mitochondrial phosphate carrier protein (R49231). 
Target expression data of the plurality of down-stream genes in a target sample containing the target WT1 gene is also 

25 input into a computer. The target and wild-type expression data are compared to determine differences. Such differ- 
ences suggest a mutation in the target WT1 gene. 

[0021] According to yet another aspect of the invention a method of diagnosing neoplasia of a test cell is provided. 
Expression levels are determined in a test cell and in a control cell of at least one gene which is activated by or 
repressed by WT1. The genes are selected from the group consisting of: natural killer cells protein 4 precursor 

30 (M59807), heat shock protein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 1 (T66307), procol- 
lagen alpha 1 (T51558), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriuretic protein 
(BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin (R71870), purine nucleo- 
side phosphorylase (T47964), adrenomedullin (D14874), gravin (M96322), jun-B (X51345), elongation factor 1 alpha- 
2 (X79490), homeotic gene regulator (R16977), clone 9112 (X57348), interferon gamma treatment inducible mRNA 

35 (M26683), transmembrane receptor protein (Z17227) and mitochondrial phosphate carrier protein (R49231). The 
determined expression levels in the test cell is compared to the determined expression levels in the control cell. The test 
cell is identified as neoplastic if (1) the level of expression of at least one of the genes which is a WT1 -activated gene 
is lower in the test cell than in the control cell, or (2) the level of expression of at least one of the genes which is WT1- 
repressed is higher in the test cell than in the control cell. 

40 [0022] Another aspect of the invention provides a method of diagnosing neoplasia of a test cell. Expression levels 
in a test cell and in a control cell are determined of at least one gene which is activated by or repressed by a EWS-WT1 
fusion protein. The determined expression levels in the test cell are compared to the determined expression levels in 
the control cell. A test cell is identified as neoplastic if (1) expression in the test cell of a EWS-WT1 -activated gene is 
lower than expression in a normal cell, or (2) expression in the test cell of a EWS-WT1 -repressed gene is higher than 

45 expression in the normal cell. 

[0023] Also provided is an additional method of diagnosing neoplasia of a test cell. Expression levels are deter- 
mined in a test cell and in a control cell of at least three genes which are activated by or repressed by WT1 . The deter- 
mined expression levels in the test cell are compared to the determined expression levels in the control cell. A test cell 
is identified as neoplastic if (1) expression in the test cell of a WT1 -activated gene is lower than expression in a normal 

so cell, or (2) expression in the test cell of a WT1 -repressed gene is higher than expression in the normal cell. 

DETAILED DESCRIPTION OF THE INVENTION 

[0024] It is a discovery of the present inventors that WT1 regulates the transcription of a set of genes, the size of 
55 the set being far greater than previously suggested. Sets of genes have been identified which are up-regulated and 
down-regulated by WT1 , the latter set being far smaller. In addition, genes have been identified which are up-regulated 
by the EWS-WT1 gene fusion caused by a translocation. This translocation occurs in desmoplastic round cell tumors, 
such as Ewing's Sarcomas. Identification of these sets of genes permits their use in a variety of diagnostic and analyt- 
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ical methods. It also permits their use to screen compounds for drugs which have the same or similar effects as WT1 
and for drugs which have the opposite effect from the EWS-WT1 gene fusion. 

[0025] Tables 1 a and 1 b show the genes which are regulated by WT1 . Of those genes only caveolin and elongation 
factor 1 alpha-2 are down-regulated by WT1 . The others are up-regulated by WT1 . Table 2 shows a subset of the WT1- 

5 regulated genes which were known to have a previously suggested link to cancer and what that link is. Table 3 shows 
the genes which are regulated by the EWS-WT1 fusion gene. All of these genes are up-regulated by the fusion gene. 
Other genes are undoubtedly regulated by these genes, and they can be identified using similar techniques to those 
described in the example. Alternatively, other techniques such as Serial Analysis of Gene Expression (SAGE) can be 
used to identify regulated genes which may not yet have been identified or made part of the public databases. 

10 [0026] Expression of genes which are regulatorily downstream of WT1 or EWS-WT1 can be monitored by any tech- 
niques known in the art. These include but are not limited to the use of SAGE, hybridization to probes on an array, West- 
ern blot, Northern blot, dot blot, slot blot, ELISA, radioimmunoassay. Any technique which can be used to quantitate a 
gene product, whether mRNA or protein, can be used. The methods of the present invention predominantly analyze 
changes in a test sample as compared to a control sample. Precise quantitation need not be accomplished. In many 

15 cases comparisons will suffice without ever determining an absolute amount of a gene product. 

[0027] Target cells, as used in the present invention are those cells which are being tested. Typically they are of an 
unknown mutational status. In some instances, a mutation is identified in the target cells, but its functional effect is 
unknown. Reference cells are preferably as similar as possible to the target cells, to permit the easiest detection of dif- 
ferences which are due to the WT1 gene or the EWS-WT1 gene. Reference cells typically comprise wild-type WT1 alle- 

20 les. Often the reference cells are taken from the same individual and from the tissue adjacent to a, tumor sample which 
is being tested. 

[0028] Differences between target cells and reference cells may be small or large. Some small differences may be 
very reproducible and therefore useful. For other purposes, large differences may be desirable for ease of detection. 
For some assays it may be useful to set threshold levels of change. For some purposes such threshold levels may be 
25 0.5, 2, 3, 5, 7, or 1 0 fold. This may depend on the technique being used for detection as well as on the number of genes 
which are being tested. One of skill in the art can readily set threshold levels which are preferred in a particular meth- 
odological context. 

[0029] The number of downstream genes analyzed can vary depending, for example, on the magnitude of induced 
changes and their reproducibility. In some instances, larger numbers of genes may be analyzed to more accurately 
30 reflect a WT1 change, since other genes may also affect expression of a particular gene being analyzed. Typically at 
least 1, 2, 3, 4, 5, 7, 10, or 15 downstream genes of WT1 or EWS-WT1 are analyzed. Analysis of a number of genes 
greater than 1 can be accomplished simultaneously, sequentially, or cumulatively. 

[0030] Computers can be used in the present invention to store data, to perform mathematical comparisons, to 
store results. Data can be manually input or may be fed from other machines, such as optical scanners, scintillation 
35 counters, and the like. High through-put screens can thus be efficiently run and analyzed. 

I. Definitions 

[0031] 

40 

Hybridizing specifically to: The phrase "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of 
a molecule substantially to or only to a particular nucleotide sequence or sequences under stringent conditions 
when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. 

45 mRNA or transcript: The term "mRNA" refers to transcripts of a gene. Transcripts are RNA including, for example, 
mature messenger RNA ready for translation, products of various stages of transcript processing. Transcript 
processing may include splicing, editing and degradation. 

Nucleic Acid: The terms "nucleic acid" or "nucleic acid molecule" refer to a deoxyribonucleotide or ribonucleotide 
so polymer in either single-or double-stranded form, and unless otherwise limited, would encompass analogs of nat- 
ural nucleotide that can function in a similar manner as naturally occurring nucleotide. A n oligonucleotide is a sin- 
gle-stranded nucleic acid of 2 to n bases, where n may be greater than 500 to 1 000. Nucleic acids may be cloned 
or synthesized using any technique known in the art. They may also include non-naturally occurring nucleotide ana- 
logs, such as those which are modified to improve hybridization and peptide nucleic acids. 

55 

Nucleic acid encoding a regulatory molecule: The regulatory molecule may be DNA, RNA or protein. Thus for 
example DNA sites which bind protein or other nucleic acid molecules are included within the class of regulatory 
molecules encoded by a nucleic acid. 
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Probe: As used herein a "probe" is defined as a nucleic acid, capable of binding to a target nucleic acid of comple- 
mentary sequence through one or more types of chemical bonds, usually through complementary base pairing, 
usually through hydrogen bond formation. As used herein, a probe may include natural (i.e. A, G, U, C, or T) or 
modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in probes may be joined by a linkage other 
than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, probes may be peptide 
nucleic acids in which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 

Target nucleic acid: The term "target nucleic acid" refers to a nucleic acid (often derived from a biological sample), 
to which the probe is designed to specifically hybridize. It is either the presence or absence of the target nucleic 
acid that is to be detected, or the amount of the target nucleic acid that is to be quantified. The target nucleic acid 
has a sequence that is complementary to the nucleic acid sequence of the corresponding probe directed to the tar- 
get. The term target nucleic acid may refer to the specific subsequence of a larger nucleic acid to which the probe 
is directed or to the overall sequence {e.g., gene or mRNA) whose expression level it is desired to detect. The dif- 
ference in usage will be apparent from context. 

Stringent conditions: The term "stringent conditions" refers to conditions under which a probe will hybridize to its 
target subsequence, but with only insubstantial hybridization to other sequences or to other sequences such that 
the difference may be identified. Stringent conditions are sequence-dependent and will be different in different cir- 
cumstances. Longer sequences hybridize specifically at higher temperatures. Generally, stringent conditions are 
selected to be about 5 °C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic 
strength and pH. 

Subsequence: "Subsequence" refers to a sequence of nucleic acids that comprise a part of a longer sequence of 
nucleic acids. 

Thermal melting point (Tm): The Tm is the temperature, under defined ionic strength, pH, and nucleic acid concen- 
tration, at which 50% of the probes complementary to the target sequence hybridize to the target sequence at equi- 
librium. As the target sequences are generally present in excess, at Tm, 50% of the probes are occupied at 
equilibrium). Typically, stringent conditions will be those in which the salt concentration is at least about 0.01 to 1 .0 
M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30 C for short probes 
(e.g., 10 to 50 nucleotide). Stringent conditions may also be achieved with the addition of destabilizing agents such 
as formamide. 

Quantifying: The term "quantifying" or "quantitating" when used in the context of quantifying transcription levels of 
a gene can refer to absolute or to relative quantification. Absolute quantification may be accomplished by inclusion 
of known concentration(s) of one or more target nucleic acids (e.g. control nucleic acids such as Bio B or with 
known amounts the target nucleic acids themselves) and referencing the hybridization intensity of unknowns with 
the known target nucleic acids (e.g. through generation of a standard curve). Alternatively, relative quantification 
can be accomplished by comparison of hybridization signals between two or more genes, or between two or more 
treatments to quantify the changes in hybridization intensity and, by implication, transcription level. 

Up-stream or down-stream gene. If the expression of a first gene is regulated by a second gene, the second gene 
is called an "up-stream gene" for the first gene and the first gene is the "down-stream" gene of the second gene. 
The regulation of the first gene by second gene could be through trans-activation. For example, the first gene 
encodes a transcriptional factor that controls the expression of the second gene. The regulation can also be 
exerted by a cis-acting mechanism. For example, the first gene is in the proximity of the second gene and exerts a 
positional effect on the expression of the second gene. In this case, the first gene does not have to be expressed 
in order to have an influence on the second gene. 

[0032] Activity of a gene is reflected by the activity of its product(s): the proteins or other molecules encoded by the 
gene. Those product molecules perform biological functions. Directly measuring the activity of a gene product is, how- 
ever, often difficult for certain genes. Instead, the immunological activities or the amount of the final product(s) or its 
peptide processing intermediates are determined as a measurement of the gene activity. More frequently, the amount 
or activity of intermediates, such as transcripts, RNA processing intermediates, or mature mRNAs are detected as a 
measurement of gene activity. 

[0033] In many cases, the form and function of the final product(s) of a gene is unknown. In those cases, the activity 
of a gene is measured conveniently by the amount or activity of transcript(s), RNA processing intermediate(s), mature 
mRNA(s) or its protein product(s) or functional activity of its protein product(s). 
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[0034] Any methods that measure the activity of a gene are useful for at least some embodiments of this invention. 
For example, traditional Northern blotting and hybridization, nuclease protection, RT- PCR and differential display have 
been used for detecting gene activity. Those methods are useful for some embodiments of the invention. However, this 
invention is most useful in conjunction with methods for detecting the. expression of a large number of genes. 

5 [0035] High-density arrays are particularly useful for monitoring the expression control at the transcriptional, RNA 
processing and degradation level. The fabrication and application of high-density arrays in gene expression monitoring 
have been disclosed previously in, for example, WO 97/1 0365, WO 92/1 0588, U.S. Application Ser. No. 08/772,376 filed 
December 23, 1996; serial number 08/529,1 15 filed on September 15, 1995; serial number 08/168,904 filed December 
15, 1993; serial number 07/624,1 14 filed on December 6, 1990, serial number 07/362,901 filed June 7, 1990, all incor- 
porated herein for all purposed by reference. In some embodiment using high-density arrays, high-density oligonucle- 
otide arrays are synthesized using methods such as the Very Large Scale Immobilized Polymer Synthesis (VLSIPS) 
disclosed in U.S. Pat. No. 5,445,934 incorporated herein for ail purposes by reference. Each oligonucleotide occupies 
a known location on a substrate. A nucleic acid target sample is hybridized with a high-density array of oligonucleotides 
and then the amount of target nucleic acids hybridized to each probe in the array is quantified. One preferred quantify- 

15 ing method is to use confocal microscope and fluorescent labels. The GeneChip® system (Affymetrix, Santa Clara, CA) 
is particularly suitable for quantifying the hybridization; however, it will be apparent to those of skill in the art that any 
similar systems or other effectively equivalent detection methods can also be used. 

[0036] High-density arrays are suitable for quantifying a small variations in expression levels of a gene in the pres- 
ence of a large population of heterogeneous nucleic acids. Such high-density arrays can be fabricated either by de novo 

20 synthesis on a substrate or by spotting or transporting nucleic acid sequences onto specific locations of substrate. 
Nucleic acids are purified and/or isolated from biological materials, such as a bacterial plasmid containing a cloned seg- 
ment of sequence of interest. Suitable nucleic acids are also produced by amplification of templates. As a nonlimiting 
illustration, polymerase chain reaction, and/or in vitro transcription, are suitable nucleic acid amplification methods. 
[0037] ' Synthesized oligonucleotide arrays are particularly preferred for this invention. Oligonucleotide arrays have 

25 numerous advantages, as opposed to other methods, such as efficiency of production, reduced intra- and inter array 
variability, increased information content and high signal-to-noise ratio. 

[0038] Preferred high-density arrays for gene function identification and genetic network mapping comprise greater 
than about 100, preferably greater than about 1000, more preferably greater than about 16,000 and most preferably 
greater than 65,000 or 250,000 or even greater than about 1 ,000,000 different oligonucleotide probes, preferably in less 

30 than 1 cm 2 of surface area. The oligonucleotide probes range from about 5 to about 50 or about 500 nucleotides, more 
preferably from about 10 to about 40 nucleotide and most preferably from about 15 to about 40 nucleotides in length. 
[0039] Generally methods of monitoring gene expression involve (a) providing a pool of target nucleic acids com- 
prising RNA transcript(s) of one or more target gene(s), or nucleic acids derived from the RNA transcript(s); (b) hybrid- 
izing the nucleic acid sample to a high-density array of probes and (c) detecting the hybridized nucleic acids and 

35 calculating a relative and/or absolute expression (transcription, RNA processing or degradation) level. 

(A) Providing a Nucl eic Acid Sample 

[0040] One of skill in the art will appreciate that it is desirable to have nucleic samples containing target nucleic acid 
40 sequences that reflect the transcripts of interest. Therefore, suitable nucleic acid samples may contain transcripts of 
interest. Suitable nucleic acid samples, however, may contain nucleic acids derived from the transcripts of interest. As 
used herein, a nucleic acid derived from a transcript refers to a nucleic acid for whose synthesis the mRNA transcript 
or a subsequence thereof has ultimately served as a template. Thus, a cDNA reverse transcribed from a transcript, an 
RNA transcribed from that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified DNA, etc., 
45 are all derived from the transcript and detection of such derived products is indicative of the presence and/or abundance 
of the original transcript in a sample. Thus, suitable samples include, but are not limited to, transcripts of the gene or 
genes, cDNA reverse transcribed from the transcript, cRNA transcribed from the cDNA, DNA amplified from the genes, 
RNA transcribed from amplified DNA, and the like. 

[0041] Transcripts, as used herein, may include, but not limited to pre-mRNA nascent transcript(s), transcript 
so processing intermediates, mature mRNA(s) and degradation products. It is not necessary to monitor all types of tran- 
scripts to practice this invention. For example, one may choose to practice the invention to measure the mature mRNA 
levels only. 

[0042] In one embodiment, such sample is a homogenate of cells or tissues or other biological samples. Preferably, 
such sample is a total RNA preparation of a biological sample. More preferably in some embodiments, such a nucleic 
55 acid sample is the total mRNA isolated from a biological sample. Those of skill in the art will appreciate that the total 
mRNA prepared with most methods includes not only the mature mRNA, but also the RNA processing intermediates 
and nascent pre-mRNA transcripts. For example, total mRNA purified with a poly (dT) column contains RNA molecules 
■ with poly (A) tails. Those polyA + RNA molecules could be mature mRNA, RNA processing intermediates, nascent tran- 
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scripts or degradation intermediates. 

[0043] Biological samples may be of any biological tissue or fluid or cells from any organism. Frequently the sample 
will be a "clinical sample" which is a sample derived from a patient. Clinical samples provide a rich source of information 
regarding the various states of genetic network or gene expression. Some embodiments of the invention are employed 
5 to detect mutations and to identify the phenotype of mutations. Such embodiments have extensive applications in clin- 
ical diagnostics and clinical studies. Typical clinical samples include, but are not limited to, sputum, blood, blood cells 
(e.g., white cells), tissue or fine needle biopsy samples, urine, peritoneal fluid, and pleural fluid, or cells therefrom. Bio- 
logical samples may also include sections of tissues, such as frozen sections or formalin fixed sections taken for histo- 
logical purposes. 

W [0044] Another typical source of biological samples are cell cultures where gene expression states can be manip- 
ulated to explore the relationship among genes. In one aspect of the invention, methods are provided to generate bio- 
logical samples reflecting a wide variety of states of the genetic network. 

[0045] One of skill in the art would appreciate that it is desirable to inhibit or destroy RNase present in homogenates 
before homogenates can be used for hybridization. Methods of inhibiting or destroying nucleases are well known in the 
15 art. In some preferred embodiments, cells or tissues are homogenized in the presence of chaotropic agents to inhibit 
nuclease. In some other embodiments, RNase is inhibited or destroyed by heat treatment followed by proteinase treat- 
ment. 

[0046] Methods of isolating total mRNA are also well known to those of skill in the art. For example, methods of iso- 
lation and purification of nucleic acids are described in detail in Chapter 3 of Laboratory Techniques in Biochemistry and 

20 Molecular Biology: Hybridization With Nucleic Acid Probes, Part I. Theory and Nucleic Acid Preparation, P. Tijssen, ed. 
Elsevier, N.Y (1993) and Chapter 3 of Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization 
With Nucleic Acid Probes, Parti. Theory and Nucleic Acid Preparation, P. Tijssen, ed. Elsevier, N.Y (1993)). 
[0047] In a preferred embodiment, the total RNA is isolated from a given sample using, for example, an acid gua- 
nidinium-phenol-chloroform extraction method and polyA + mRNA is isolated by oligo(dT) column chromatography or by 

25 using (dT) on magnetic beads (see, e.g., Sambrook et a/. f Molecular Cloning: A Laboratory Manual (2nd ed.), Vols. 1 - 
3, Cold Spring Harbor Laboratory, (1 989), or Current Protocols in Molecular Biology, F. Ausubel et al, ed. Greene Pub- 
lishing and Wiley-lnterscience, New York (1987)). 

[0048] Frequently, it is desirable to amplify the nucleic acid sample prior to hybridization. One of skill in the art will 
appreciate that whatever amplification method is used, if a quantitative result is desired, care must be taken to use a 
30 method that maintains or. controls for the relative frequencies of the amplified nucleic acids to achieve quantitative 
amplification. 

[0049] Methods of "quantitative" amplification are well known to those of skill in the art. For example, quantitative 
PCR involves simultaneously co-amplifying a known quantity of a control sequence using the same primers. This pro- 
vides an internal standard that may be used to calibrate the PCR reaction. The high-density array may then include 

35 probes specific to the internal standard for quantification of the amplified nucleic acid. 

[0050] One preferred internal standard is a synthetic AW106 cRNA. The AW106 cRNA is combined with RNA iso- 
lated from the sample according to standard techniques known to those of skilled in the art. The RNA is then reverse 
transcribed using a reverse transcriptase to provide copy DNA. The cDNA sequences are then amplified (e.g., by PCR) 
using labeled primers. The amplification products are separated, typically by electrophoresis, and the amount of radio- 

40 activity (proportional to the amount of amplified product) is determined. The amount of mRNA in the sample is then cal- 
culated by comparison with the signal produced by the known AW106 RNA standard. Detailed protocols for quantitative 
PCR are provided in PCR Protocols, A Guide to Methods and Applications, Innis et al., Academic Press, Inc. N.Y, 
(1990). 

[0051] Other suitable amplification methods include, but are not limited to polymerase chain reaction (PCR) (Innis, 
45 etal., PCR Protocols. A guide to Methods and Application. Academic Press, Inc. San Diego, (1990)), ligase chain reac- 
tion (LCR) (see Wu and Wallace, Genomics, 4: 560 (1989), Landegren, et al., Science, 241 : 1077 (1988) and Barrin- 
ger, etal., Gene, 89: 117 (1990), transcription amplification (Kwoh, et al., Proc. Natl. Acad. Sci. USA, 86: 1 173 (1989)), 
and self-sustained sequence replication (Guatelli, etal., Proc. Nat. Acad. Sci. USA, 87: 1874 (1990)). 
[0052] Cell lysates or tissue homogenates often contain a number of inhibitors of polymerase activity. Therefore, 
so RT-PCR typically incorporates preliminary steps to isolate total RNA or mRNA for subsequent use as an amplification 
template. A one-tube mRNA capture method may be used to prepare poly(A) + RNA samples suitable for immediate RT- 
PCR in the same tube (Boehringer Mannheim). The captured mRNA can be directly subjected to RT-PCR by adding a 
reverse .transcription mix and. subsequently, a PCR mix. 

[0053] In a particularly preferred embodiment, the sample mRNA is reverse transcribed with a reverse transcriptase 
55 and a primer consisting of oligo(dT) and a sequence encoding the phage T7 promoter to provide single stranded DNA 
template. The second DNA strand is polymerized using a DNA polymerase. After synthesis of double-stranded cDNA, 
T7 RNA polymerase is added and RNA is transcribed from the cDNA template. Successive rounds of transcription from 
each single cDNA template results in amplified RNA. Methods of in vitro polymerization are well known to those of skill 
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in the art {see, e.g., Sambrook, supra.) and this particular method is described in detail by Van Gelder, et a/., Proc. 
Natl. Acad. Sci. USA, 87: 1663-1667 (1990) who demonstrate that in vitro amplification according to this method pre- 
serves the relative frequencies of the various RNA transcripts. Moreover, Eberwine et al. Proc. Natl. Acad. Sci. USA, 
89: 3010-3014 provide a protocol that uses two rounds of amplification via in vitro transcription to achieve greater than 
5 10 6 fold amplification of the original starting material, thereby permitting expression monitoring even where biological 
samples are limited. 

(B) Hybridizing nucleic acids to h igh-densitv arrays 
w 1 . Probe design 

[0054] One of skill in the art will appreciate that an enormous number of array designs are suitable for the practice 
of this invention. The high-density array will typically include a number of probes that specifically hybridize to the 
sequences of interest. In addition, in a preferred embodiment, the array will include one or more control probes. 

15 [0055] The high-density array chip includes "test probes." Test probes could be oligonucleotides that range from 
about 5 to about 45 or 5 to about 500 nucleotides, more preferably from about 1 0 to about 40 nucleotides and most pref- 
erably from about 15 to about 40 nucleotides in length. In other particularly preferred embodiments the probes are 20 
or 25 nucleotides in length. In another preferred embodiments, test probes are double or single strand DNA sequences. 
DNA sequences are isolated or cloned from nature sources or amplified from nature sources using nature nucleic acid 

20 as templates. These probes have sequences complementary to particular subsequences of the genes whose expres- 
sion they are designed to detect. Thus, the test probes are capable of specifically hybridizing to the target nucleic acid 
they are to detect. 

[0056] In addition to test probes that bind the target nucleic acid(s) of interest, the high-density array can contain a 
number of control probes. The control probes fall into three categories referred to herein as 1) normalization controls; 

25 2) expression level controls; and 3) mismatch controls. 

[0057] Normalization controls are oligonucleotide or other nucleic acid probes that are complementary to labeled 
reference oligonucleotides or other nucleic acid sequences that are added to the nucleic acid sample. The signals 
obtained from the normalization controls after hybridization provide a control for variations in hybridization conditions, 
label intensity, "reading" efficiency and other factors that may cause the signal of a perfect hybridization to vary between 

30 arrays. In a preferred embodiment, signals {e.g., fluorescence intensity) read from all other probes in the array are 
divided by the signal {e.g., fluorescence intensity) from the control probes thereby normalizing the measurements. 
[0058] Virtually any probe may serve as a normalization control. However, it is recognized that hybridization effi- 
ciency varies with base composition and probe length. Preferred normalization probes are selected to reflect the 
avenge length of the other probes present in the array, however, they can be selected to cover a range of lengths. The 

35 normalization control(s) can also be selected to reflect the (average) base composition of the other probes in the array, 
however in a preferred embodiment, only one or a few normalization probes are used and Jhey are selected such that 
they hybridize well (i.e. no secondary structure) and do not match any target-specific probes. 
[0059] Expression level controls are probes that hybridize specifically with constitutively expressed genes in the 
biological sample. Virtually any constitutively expressed gene provides a suitable target for expression level controls. 

40 Typically expression level control probes have sequences complementary to subsequences of constitutively expressed 
"housekeeping genes" including, but not limited to the p-actin gene, the transferrin receptor gene, the GAPDH gene, 
and the like. 

[0060] Mismatch controls may also be provided for the probes to the target genes, for expression level controls or 
for normalization controls. Mismatch controls are oligonucleotide probes or other nucleic acid probes identical to their 

45 corresponding test or control probes except for the presence of one or more mismatched bases. A mismatched base is 
a base selected so that it is not complementary to the corresponding base in the target sequence to which the probe 
would otherwise specifically hybridize. One or more mismatches are selected such that under appropriate hybridization 
conditions (e.g. stringent conditions) the test or control probe would be expected to hybridize with its target sequence, 
but the mismatch probe would not hybridize (or would hybridize to a significantly lesser extent). Preferred mismatch 

50 probes contain a central mismatch. Thus, for example, where a probe is a 20 mer, a corresponding mismatch probe will 
have the identical sequence except for a single base mismatch (e.g., substituting a G, a C or a T for an A) at any of 
positions 6 through 14 (the central mismatch). 

[0061] Mismatch probes thus provide a control for nonrspecific.binding or cross-hybridization to a nucleic acid in the 
sample other than the target to which the probe is directed. Mismatch probes thus indicate whether a hybridization is 
55 specific or not. For example, if the target is present the perfect match probes should be consistently brighter than the 
mismatch probes. In addition, if all central mismatches are present, the mismatch probes can be used to detect a muta- 
tion. The difference in intensity between the perfect match and the mismatch probe (l(PM)-l(MM)) provides a good 
measure of the concentration of the hybridized material. 
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[0062] The high-density array may also include sample preparation/amplification control probes. These are probes 
that are complementary to subsequences of control genes selected because they do not normally occur in the nucleic 
acids of the particular biological sample are assayed. Suitable sample preparation/amplification control probes include, 
for example, probes to bacterial genes (e.g., Bio B) where the sample in question is a biological from a eukaryote. 
[0063] The RNA sample is then spiked with a known amount of the nucleic acid to which the sample prepara- 
tion/amplification control probe is directed before processing. Quantification of the hybridization of the sample prepara- 
tion/amplification control probe then provides a measure of alteration in the abundance of the nucleic acids caused by 
processing steps {e.g. PCR, reverse transcription, in vitro transcription, etc.). 

[0064] In a preferred embodiment, oligonucleotide probes in the high-density array are selected to bind specifically 
to the nucleic acid target to which they are directed with minimal non-specific binding or cross-hybridization under the 
particular hybridization conditions utilized. Because the high-density arrays of this invention can contain in excess of 
1,000,000 different probes, it is possible to provide every probe of a characteristic length that binds to a particular 
nucleic acid sequence. 

[0065] In addition, in a preferred embodiment, expression monitoring arrays are used to identify the presence and 
expression (transcription) level of genes which are several hundred base pairs long. For most applications it would be 
useful to identify the presence, absence, or expression level of several thousand to one hundred thousand genes. 
Because the number of oligonucleotides per array is limited in a preferred embodiment, it is desired to include only a 
limited set of probes specific to each gene whose expression is to be detected. 

[0066] As disclosed in U.S. Application Ser. No. 08/772,376, probes as short as 15, 20, or 25 nucleotide are suffi- 
cient to hybridize to a subsequence of a gene and that, for most genes, there is a set of probes that performs well across 
a wide range of target nucleic acid concentrations. In a preferred embodiment, it is desirable to choose a preferred or 
"optimum" subset of probes for each gene before synthesizing the high-density array. 



2. Hybridization 

[0067] Nucleic acid hybridization simply involves contacting a probe and target nucleic acid under conditions where 
the probe and its complementary target can form stable hybrid duplexes through complementary base pairing. The 
nucleic acids that do not form hybrid duplexes are then washed away leaving the hybridized nucleic acids to be 
detected, typically through detection of an attached detectable label. It is generally recognized that nucleic acids are 
denatured by increasing the temperature or decreasing the salt concentration of the buffer containing the nucleic acids. 
Under low stringency conditions (e.g., low temperature and/or high salt) hybrid duplexes (e.g., DNA:DNA, RNA:RNA, 
or RNA:DNA) will form even where the annealed sequences are not perfectly complementary. Thus specificity of hybrid- 
ization is reduced at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower salt) success- 
ful hybridization requires fewer mismatches. 

[0068] One of skill in the art will appreciate that hybridization conditions may be selected to provide any degree of. 
stringency. In a preferred embodiment, hybridization is performed at low stringency in this case in 6X SSPE-T at 37 C 
(0.005% Triton X-100) to ensure hybridization and then subsequent washes are performed at higher stringency (e.g., 1 
X SSPE-T at 37 C) to eliminate mismatched hybrid duplexes. Successive washes may be performed at increasingly 
higher stringency (e.g., down to as low as 0.25 X SSPE-T at 37 C to 50 C) until a desired level of hybridization specificity 
is obtained. In another preferred embodiment 1 X MES buffer is used. The buffer comprises 0.1 M 2-[N-Morpho- 
linio]ethanesulfonic acid, 1.0 M NaCI, and 0.01 % Triton-X 100™. Stringency can also be increased by addition of 
agents such as formamide. Hybridization specificity may be evaluated by comparison of hybridization to the test probes 
with hybridization to the various controls that can be present (e.g., expression level control, normalization control, mis- 
match controls, etc.). 

[0069] In general, there is a tradeoff between hybridization specificity (stringency) and signal intensity. Thus, in a 
preferred embodiment, the wash is performed at the highest stringency that produces consistent results and that pro- 
vides a signal intensity greater than approximately 10% of the background intensity. Thus, in a preferred embodiment, 
the hybridized array may be washed at successively higher stringency solutions and read between each wash. Analysis 
of the data sets thus produced will reveal a wash stringency above which the hybridization pattern is not appreciably 
altered and which provides adequate signal for the particular oligonucleotide probes of interest. 
[0070] ' The stability of duplexes formed between RNAs or DNAs are generally in the order of RNA:RNA > RNA:DNA 
> DNA:DNA, in solution. Long probes have better duplex stability with a target, but poorer mismatch discrimination than 
shorter probes (mismatch discrimination refers to the measured hybridization signal ratio between a perfect match 
probe and a single base mismatch probe). Shorter probes (e.g., 8-mers) discriminate mismatches very well, but the 
overall duplex stability is low. 

[0071 ] Altering the thermal stability (T m ) of the duplex formed between the target and the probe using, e.g., known 
oligonucleotide analogues allows for optimization of duplex stability and mismatch discrimination. One useful aspect of 
altering the T m arises from the fact that adenine-thymine (A-T) duplexes have a lower T m than guanine-cytosine (G-C) 
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duplexes, due in part to the fact that the A-T duplexes have 2 hydrogen bonds per base-pair, while the G-C duplexes 
have 3 hydrogen bonds per base pair. In heterogeneous oligonucleotide arrays in which there is a non-uniform distribu- 
tion of bases, it is not generally possible to optimize hybridization for each oligonucleotide probe simultaneously. Thus, 
in some embodiments, it is desirable to selectively destabilize G-C duplexes and/or to increase the stability of A-T 
duplexes. This can be accomplished, e.g. by substituting guanine residues in the probes of an array which form G-C 
duplexes with hypoxanthine, or by substituting adenine residues in probes which form A-T duplexes with 2,6 diaminop- 
urine or by using the salt tetramethyl ammonium chloride (TMACI) in place of NaCI. 

[0072] Altered duplex stability conferred by using oligonucleotide analogue probes can be ascertained by following, 
e.g., fluorescence signal intensity of oligonucleotide analogue arrays hybridized with a target oligonucleotide over time. 
The data allow optimization of specific hybridization conditions at, e.g., room temperature (for simplified diagnostic 
applications in the future). 

[0073] Another way of verifying altered duplex stability is by following the signal intensity generated upon hybridiza- 
tion with time. Previous experiments using DNA targets and DNA chips have shown that signal intensity increases with 
time, and that the more stable duplexes generate higher signal intensities faster than less stable duplexes. The signals 
reach a plateau or "saturate" after a certain amount of time due to all of the binding sites becoming occupied. These 
data allow for optimization of hybridization, and determination of the best conditions at a specified temperature. 
[0074] Methods of optimizing hybridization conditions are well known to those of skill in the art (see, e.g., Labora- 
tory Techniques in Biochemistry and Moiecuiar Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed. 
Elsevier, N.Y., (1993)). 

(C) Signal Detection 

[0075] In a preferred embodiment, the hybridized nucleic acids are detected by detecting one or more labels 
attached to the sample nucleic acids. The labels may be incorporated by any of a number of means well known to those 
25 of skill in the art. However, in a preferred embodiment, the label is simultaneously incorporated during the amplification 
step in the preparation of the sample nucleic acids. Thus, for example, polymerase chain reaction (PCR) with labeled 
primers or labeled nucleotides will provide a labeled amplification product. In a preferred embodiment, transcription 
amplification, as described above, using a labeled nucleotide (e.g. fluorescein-labeled UTP and/or CTP) incorporates 
a label into the transcribed nucleic acids. 
30 [0076] Alternatively, a label may be added directly to the original nucleic acid sample (e.g., mRNA, polyA mRNA, . 
cDNA, etc.) or to the amplification product after the amplification is completed. Means of attaching labels to nucleic 
acids are well known to those of skill in the art and include, for example nick translation or end-labeling (e.g. with a 
labeled RNA) by kinasing of the nucleic acid and subsequent attachment (ligation) of a nucleic acid linker joining the 
sample nucleic acid to a label (e.g., a fluorophore). 
■35 [0077] Detectable labels suitable for use in the present invention include any composition detectable by spectro- 
scopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. Useful labels in the 
present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads™), 
fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like), radiolabels (e.g., 3 H, 
12 5I, 35 S, 1 4C, or 3 2P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an 
40 ELISA), and colorimetric labels such as colloidal gold or colored glass or plastic (e.g., polystyrene, polypropylene, latex, 
- etc.) beads. Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; and 4,366,241. 

[0078] Means of detecting such labels are well known to those of skill in the art. Thus, for example, radiolabels may 
be detected using photographic film or scintillation counters, fluorescent markers may be detected using a photodetec- 
' 45 tor to detect emitted light. Enzymatic labels are typically detected by providing the enzyme with a substrate and detect- 
ing the reaction product produced by the action of the enzyme on the substrate, and colorimetric labels are detected by 
simply visualizing the colored label. One particular preferred methods uses colloidal gold label that can be detected by 
measuring scattered light. 

[0079] The label may be added to the target (sample) nucleic acid(s) prior to, or after the hybridization. So called 
so "direct labels" are detectable labels that are directly attached to or incorporated into the target (sample) nucleic acid 

prior to hybridization. In contrast, so called "indirect labels" are joined to the hybrid duplex after hybridization. Often; the 

indirect label is attached to a binding moiety that has been attached to the target nucleic acid prior to the hybridization. 

Thus, for example, the target nucleic acid may be biotinylated before the hybridization. After hybridization, an avidin- 

conjugated fluorophore will bind the biotin bearing hybrid duplexes providing a label that is easily detected. For a 
55 detailed review of methods of labeling nucleic acids and detecting labeled hybridized nucleic acids see Laboratory 

Techniques in Biochemistry and Molecular Biology Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed. 

Elsevier, N.Y., (1993)). 

[0080] Fluorescent labels are preferred and easily added during an in vitro transcription reaction. In a preferred 
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embodiment, fluorescein labeled UTP and CTP are incorporated into the RNA produced in an in vitro transcription reac- 
tion as described above. 

[0081] Means of detecting labeled target (sample) nucleic acids hybridized to the probes of the high-density array 
are known to those of skill in the art. Thus, for example, where a colorimetric label is used, simple visualization of the 
5 label is sufficient. Where a radioactive labeled probe is used, detection of the radiation (e.g. with photographic film or a 
solid state detector) is sufficient. 

[0082] In a preferred embodiment, however, the target nucleic acids are labeled with a fluorescent label and the 
localization of the label on the probe array is accomplished with fluorescent microscopy. The hybridized array is excited 
with a light source at the excitation wavelength of the particular fluorescent label and the resulting fluorescence at the 
w emission wavelength is detected. In a particularly preferred embodimeht, the excitation light source is a laser appropri- 
ate for the excitation of the fluorescent label. 

[0083] The confocal microscope may be automated with a computer-controlled stage to automatically scan the 
entire high-density array. Similarly, the microscope may be equipped with a phototransducer {e.g., a photomultiplier, a 
solid state array, a CCD camera, etc.) attached to an automated data acquisition system to automatically record the flu- 

15 orescence signal produced by hybridization to each oligonucleotide probe on the array. Such automated systems are 
described at length in U.S. Patent No: 5,143,854, PCT Application 20 92/10092, and copending U.S. Application Ser. 
No. 08/195,889 filed on February 10, 1994. Use of laser illumination in conjunction with automated confocal microscopy 
for signal detection permits detection at a resolution of better than about 1 00 ^m, more preferably better than about 50 
^im, and most preferably better than about 25 |xm. 

20 [0084] One of skill in the art will appreciate that methods for evaluating the hybridization results vary with the nature 
of the specific probe nucleic acids used as well as the controls provided. In the simplest embodiment, simple quantifi- 
cation of the fluorescence intensity for each probe is determined. This is accomplished simply by measuring probe sig- 
nal strength at each location (representing a different probe) on the high-density array (e.g., where the label is a 
fluorescent label, detection of the amount of florescence (intensity) produced by a fixed excitation illumination at each 

25 location on the array). Comparison of the absolute intensities of an array hybridized to nucleic acids from a "test" sample 
with intensities produced by a "control" sample provides a measure of the relative expression of the nucleic acids that 
hybridize to each of the probes. 

[0085] One of skill in the art, however, will appreciate that hybridization signals will vary in strength with efficiency 
of hybridization, the amount of label on the sample nucleic acid and the amount of the particular nucleic acid in the sam- 
30 pie. Typically nucleic acids present at very low levels (e.g., < 1pM) will show a very weak signal. At some low level of 
concentration, the signal becomes virtually indistinguishable from background. In evaluating the hybridization data, a 
threshold intensity value may be selected below which a signal is not counted as are essentially indistinguishable from 
background. 

[0086] Where it is desirable to detect nucleic acids expressed at lower levels, a lower threshold is chosen. Con- 
35 versely, where only high expression levels are to be evaluated a higher threshold level is selected. In a preferred embod- 
iment, a suitable threshold is about 10% above that of the average background signal. 

[0087] In addition, the provision of appropriate controls permits a more detailed analysis that controls for variations 
in hybridization conditions, cell health, non-specific binding and the like. Thus, for example, in a preferred embodiment, 
the hybridization array is provided with normalization controls. These normalization controls are probes complementary 

40 to control sequences added in a known concentration to the sample. Where the overall hybridization conditions are 
poor, the normalization controls will show a smaller signal reflecting reduced hybridization. Conversely, where hybridi- . 
zation conditions are good, the normalization controls will provide a higher signal reflecting the improved hybridization. 
Normalization of the signal derived from other probes in the array to the normalization controls thus provides a control 
for variations in hybridization conditions. Typically, normalization is accomplished by dividing the measured signal from 

45 the other probes in the array by the average signal produced by the normalization controls. Normalization may also 
include correction for variations due to sample preparation and amplification. Such normalization may be accomplished 
by dividing the measured signal by the average signal from the sample preparation/amplification control probes (e.g., 
the Bio B probes). The resulting values may be multiplied by a constant value to scale the results. 
[0088] As indicated above, the high-density array can include mismatch controls. In a preferred embodiment, there 

so is a mismatch control having a central mismatch for every probe (except the normalization controls) in the array. It is 

" " expected that after washing in stringent conditions, where a perfect match would be expected to hybridize to the probe, 
but not to the mismatch, the signal from the mismatch controls should only reflect non-specific binding or the presence 
in the sample of a nucleic acid that hybridizes with the mismatch. Where both the probe in question and its correspond- 
ing mismatch control both show high signals, or the mismatch shows a higher signal than its corresponding test probe, 

55 there is a problem with the hybridization and the signal from those probes is ignored. The difference in hybridization sig- 
nal intensity between the target specific probe and its corresponding mismatch control is a measure of the discrimina- 
tion of the target-specific probe. Thus, in a preferred embodiment, the signal of the mismatch probe is subtracted from 
the signal from its corresponding test probe to provide a measure of the signal due to specific binding of the test probe. 
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[0089] The concentration of a particular sequence can then be determined by measuring the signal intensity of 
each of the probes that bind specifically to that gene and normalizing to the normalization controls. Where the signal 
from the probes is greater than the mismatch, the mismatch is subtracted. Where the mismatch intensity is equal to or 
greater than its corresponding test probe, the signal is ignored.. The expression level of a particular gene can then be 

5 scored by the number of positive signals (either absolute or above a threshold value), the intensity of the positive signals 
(either absolute or above a selected threshold value), or a combination of both metrics {e.g., a weighted average). 
[0090] In some preferred embodiments, a computer system is used to compare the hybridization intensities of the 
perfect match and mismatch probes of each pair. If the gene is expressed, the hybridization intensity (or affinity) of a 
perfect match probe of a pair should be recognizably higher than the corresponding mismatch probe. Generally, if the 

10 hybridizations intensities of a pair of probes are substantially the same, it may indicate the gene is not expressed. How- 
ever, the determination is not based on a single pair of probes, the determination of whether a gene is expressed is 
based on an analysis of many pairs of probes. 

[0091 ] After the system compares the hybridization intensity of the perfect match and mismatch probes, the system 
indicates expression of the gene. As an example, the system may indicate to a user that the gene is either present 
15 (expressed), marginal or absent (unexpressed). Specific procedures for data analysis is disclosed in U.S. Application 
08/772,376, previously incorporated for all purposes. 

[0092] In addition to high-density nucleic acid arrays, other methods are also useful for massive gene expression 
monitoring. Differential display, described by Liang, R and Pardee, A.B. (Differential Display of eukaryotic messenger 
RNA by means of the polymerase chain reaction. Science 257:967-971, 1992, incorporated herein by reference for all 

20 purposes) provides a useful mean for distinguishing gene expression between two samples. Serial analysis of gene 
expression, described by Velculescu et al. (Serial Analysis of Gene Expression. Science, 270:484-487, 1995, incorpo- 
rated herein by reference for all purposes) provides another method for quantitative and qualitative analysis of gene 
expression. Optical fiber oligonucleotide sensors, described by Ferguson et al. (A Fiber-optic DNA biosensor microar- 
ray for the analysis of gene expression. Nature-Biotechnology 14:1681-1684, 1996), can also be used for gene expres- 

25 sion monitoring. 

[0093] It is understood that the examples and embodiments described herein are for illustrative purposes only and 
that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be 
included within the spirit and purview of this application and scope of the appended claims. All publications, patents, 
and patent applications cited herein are hereby incorporated by reference for all purposes. 
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Examples 

[0094] To address the functional properties of WT1 and EWS-WT1 and examine their effects on endogenous target 
genes, we developed cells in which a tightly regulated inducible promoter allowed expression of the full length protein. 

35 Forced expression of WT1 and EWS-WT1 in these cells triggered apoptosis. Oligonucleotide array-based expression 
profiling revealed a small number of genes exhibiting altered expression following WT1 and EWS-WT1 induction. 
[0095] To study the functional properties of WT1 and EWS-WT1 , we established inducible, tetracyline-regulated 
expression (Gossen and Bujard, 1992) of WT1 and EWS-WT1 in U20S osteosarcoma cells. We searched for endog- 
enous genes whose expression levels might be altered following inducible expression of WT1 and EWS-WT1. Messen- 

40 ger RNA was isolated from cells at 12 hrs following WT1 and EWS-WT1 induction, biotinylated, and hybridized to high- 
density oligonucleotide arrays representing 6,800 known transcripts and expressed sequence tags (Lockhart et al.,, 
1996; Wodicka et al., 1997). A gene was considered a candidate WT1 and EWS-WT1 target if it was reproducibly 
induced or repressed in at least two different experimental replicates following WT1 and EWS-WT1 induction, and it 
exhibited an expression changes of at least 3-fold. 

45 [0096] In this so-called "tet-off induction system , the recombinant gene is induced by withdrawal of tetracycline 
from the tissue culture medium. The purpose of these studies was to identify candidate functional mediators (CFMs) of 
these tumor suppressors. Upon induction of WT1 or EWS-WT1 expression, we identified 28 and 1 7 endogenous genes, 
respectively (of 7000 genes monitored), that displayed reproducible expression changes of 3-fold or greater. 
[0097] By generating cell lines with tightly regulated inducible expression of WT1 and EWS-WT1 , we have identified 

50 downstream pathways that are likely to contribute to its function as a tumor suppressor. 

Hybridization-Based Assay for Generating Expression Profiles 

[0098] Messenger RNA levels are determined by hybridization of complete mRNA populations to sets of arrays 
55 containing hundreds of thousands of chemically synthesized oligonucleotides. The oligonucleotides are synthesized in 
situ on glass supports using light-directed, solid-phase combinatorial chemistry. Because the arrays are designed and 
synthesized based on sequence information alone, they provide a direct link between genomic sequence and measure- 
ments of differential gene expression. The arrays measure 1 .28 x 1.28 cm and contain more than 65,000 50 x 50 micron 
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synthesis features. Each synthesis feature consists of more than 10 7 copies of a particular 25-mer oligonucleotide. 
[0099] For each mRNA sample, the expression levels of 6,800 full-length human genes will be monitored. The full 
set of oligonucleotides covering these genes is divided over 4 different arrays, comprising a total area of approximately 
one square inch. For each gene, approximately 20 complementary 25-mers are chosen based on automated selection 
criteria. The criteria include tests for sequence uniqueness relative to the rest of the genome and the absence of 
sequence features (e.g., self-complementarity or clusters of single nucleotides) that have been determined to adversely 
affect hybridization behavior on arrays. The use of sets of oligonucleotides for each gene provides redundancy in the 
detection and analysis of the data, mitigates the potentially confounding effects of occasional cross-hybridization, and 
makes it so all oligonucleotides do not have to hybridize identically in order to obtain quantitative information. To further 
increase the sensitivity and specificity of detection, each complementary oligonucleotide (perfect match, or PM) is syn- 
thesized with a closely related mismatch (MM) partner in a physically adjacent position. The mismatch partner is iden- 
tical except for a single base difference at the central position of the 25-mer. The MM oligonucleotide of each pair serves 
as an internal control that allows consistent hybridization patterns (patterns of PM signals that are larger than the cor- 
responding MM signals) to be recognized. Quantitative image analysis is based on the average of the differences 
between the PM and MM partners, so that nonspecific and background contributions tend to cancel, while specific 
hybridization signals tend to add constructively across the set of oligonucleotide pairs for each gene. These hybridiza- 
tion signals are quantitative over three orders of magnitude, from 1 :300,000 to 1 :300. 
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Additional target genes induced by WT1(-KTS). 


Accession number 


Description 


Fold induction by Chip 


X65779 


Acidic FGF 


22.1 


X17405 


Cartilage linking protein- 1 


46.1 


X58377 


IL-11 


18.8 
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TABLE 1 



r 



E22BI 

Hilar 
Egg 

BE£B 



iCandidataWTi Targets " 

NATURAL KILLER CELLS PROTEIN 4 PRECURSOR (HUMAN);contatoa atement MSR1 repetitive element; 



Human Mate birring prote^ (FBF) mRNA, 3' and. 



. -Urnan HALPHA44 gwwfar aJpfaMubufin, exon« 1-3. 
Human mRNA for heat shock protein HSP70B'» 
Ksapkns mRNA IbfQOK product 



HEAT SHOCK 70 KQ PROTBN 1 (HUMAN);. 
Human amphlregutin (AR) mRNA, 



PROCOLLAGEN ALPHA 1(1) CHAW 



cds, donas lambda-AR1 an d tembda-A R2. 



Human beta-migrEting plasminogen activator Enhfoftw I mRflA, 3* end 
human radtatad KaraCnocyta mRNA tor cysteine proteose tnhflHtoc 
Ksaplena mRNA tor cavaogt ^ 



Human brain natriuretic protem (BNP) cane, complete cds. 



Human leukemia virus nacsptor 1 (GLVR1) mRNA, complete cds. 



KERATIN, TYPE I CYTOSKELETAL 17 (HUMAN);, 



TUBULIN ALPHAS CHAIN (GatiUS galiusfr 
PURINE kudjEOSIQE PHQSPHORYLASE (HUMAN). 



Human Gem GTPase (gem) mRNA complete cds. 



Human mRNA for adrenomedullin, complete cde. 

UROKINASE PLASMINOGEN ACTIVATOR SURFACE RECEPTOR GPhANCHORED (HUMAN)T 



Human mRNA for tissue type plasminogen activator. 



Human gravfn (a cytoplasmic antigen recognized by myasthenia gravis sera) mRNA. 3' end. 



Human Jun-B mRNA for JUN-B protein, 
Htaplfif mRNA for etangatiM factor 1 alpha-2. 
HOMEOT1C GENE REGULATOR (Dmsophfta motanog aster) 
Kta plena mRNA f done 8112), 



Human Interferon gamma (reetment Indudbto mRNA 



Ksaptens mRNA for transmenbrana receptor proton. 



DCHONPRIAL PHOSPHATE CARRIER PROTBN PRECURSOR (HUMAN);, 



TABLE 3 



EWS" 



R60906 



M2S062 



PROBABLE NUCLEAR ANTIGEN (Paaudorabiea virus) 
INTBRLEUKJN-2 RECEPTOR BETA CHAIN PRECURSOR 



H2C529 



Q1/S£P£CiRC CYCUN D1 (Homo sapiens) 



T79475 



PROBABLE NUCLEAR ANTIGEN (PaeudoraMsa virus) 



M22349 



Human nsuron-spectflc gamma-2 anojaag, compjata cds, 

Homo sapiens g$SXq2S.l;p34) fusion pane NPM-MLF1 mRNA comptete cds. 



L430S4 



H82468 



U23852 



COLLAGEN ALPHA 3(VI) CHAIN (GaMus oattua^ 



Human T-Jvmphocyte spadflo protein tyrosine kinase p56lck (Ick) ebberant mRNA. complete cds. 



U209S2 



Human Instiin4k» grov^ ta^ binding orotetn-4 (KSFBP4) gene, promoter and oompfeta oda. 



L1851Q 



Homo ai 



In B mRNA complete cds. 



T51534 



CYSTAT1N C PRECURSOR 



(HUMAN). 

DUAL SPECIFICITY MTrOQEN-ACTTVATED PROTEIN XINASE KINASE 2 (Homo sapiens)"" 
NEURONAL-GUAL CELL ADHESION MOLECULE PRECURSOR (QaBus galtus) 
PitA-i frROTOONCOGENE SERINE/THREONINE-PROTSIN KINASE (Mus musculus) 



R42291 



mm 



|rS43SO |6AR^VLESTERASE PRECURSOR (Mus musculus)"" 



PROTEIN-TYROSINE PHOSPHATASE PA01 (HUMAN);. 
INTERFfeRON-lNDUCEP PROTEIN 6M6 PRECURSOR (HUMANfccontalrts L1 repetitive element 
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Claims 

1 . A method for detecting a WT1 gene functional mutation in target cells comprising the steps of: 

detecting expression of at least three down-stream genes of WT1 in a sample of (a) target cells, and (b) refer- 
ence cells having a wild-type WT1 gene, the reference cells are otherwise substantially similar to the target 
cells, the down-stream genes are up- or down-regulated by the wild-type WT1 gene; and 
comparing the expression of the down-stream genes in the target cells and the reference cells, wherein a dif- 
ference in the expression between the target cells and reference cells suggests a WT1 functional mutation in 
the target cells. 

2. The method of claim 1, wherein said down-stream genes are transcriptionally regulated by said wild-type WT1 
gene and the expression of said down-stream genes is detected by measuring amounts of transcripts of said down- 
stream genes in said reference and target cells. 

3. The method of claim 1 , wherein said amounts of transcripts are measured with high-density nucleic acid array. 

4. The method of claim 1 , wherein said down-stream genes comprise at least one WT1 up-regulated gene selected 
from the group consisting of: natural killer cells protein 4 precursor (M59807), folate binding protein (M25317), 
HALPHA44 gene for alpha-tubulin (X06956), heat shock protein HSP70B (X51758), 90K product (H17969), heat 
shock 70kd protein 1 (T66307), amphiregulin (M30704), procollagen alpha 1 (T51558), beta-migrating plasmino- 
gen activator inhibitor 1 (M14083), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriu- 
retic protein (BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin 
(R71870), tubulin alpha-5 chain (H45051), purine nucleoside phosphorylase (T47964), Gem GTPase (U10550), 
adrenomedullin (D14874), GPI-anchored urokinase plasminogen activator surface receptor (R38636), tissue type 
plasminogen activator (X13097), gravin (M96322), jun-B(X51345), elongation factor 1 alpha-2 (X79490), homeotic 
gene regulator (R16977), clone 9112 (X57348), interferon gamma treatment inducible mRNA (M26683), trans- 
membrane receptor protein (Z17227), acidic FGF (X65779), cartilage linking protein-1(X17405), IL-11 (X58377) 
and mitochondrial phosphate carrier protein (R49231). 

5. The method of claim 1 wherein said down-stream genes comprise at least one WT1 down-regulated gene selected 
from the group consisting of caveolin and elongation factor 1 alpha-2. 

6. The method of claim 4 wherein said down-stream genes further comprise p21 or EGFR. 

7. The method of claim 1 further comprising the step of: 

indicating a loss of function mutation in the WT1 gene in the target cells if the expression of said WT1 up-reg- 
ulated genes is at least two times less in said target cells than in said reference cells or if the expression of said 
WT1 down-regulated genes is at least two times more in said target cells than in said reference cells. 

8. A method for detecting a EWS-WT1 gene fusion in target cells comprising the steps of: 

detecting expression of one or more down-stream genes of an EWS-WT1 fusion in a sample of (a) target cells, 
and (b) reference cells having a wild-type EWS and WT1 gene, said reference cells being otherwise substan- 
tially similar to said target cells, said down-stream genes being up-or down-regulated by said wild-type EWS 
and WT1 gene; and 

comparing the expression of said down-stream genes in said target ceils and said reference cells, wherein a 
difference in said expression between the target cells and reference cells suggests a EWS-WT1 fusion in the 
target cells. 

9. The method of claim 8, wherein said down-stream genes comprise at least one EWS-WT1 up-regulated gene 
selected -from the group consisting of: Pseudorabies virus nuclear antigen .(R60906 and T79475)), beta chain of 
interleukin-2 receptor (M26062), G1/S-specific cyclin D1 (H20529), neuron-specific gamma-2 enolase (M22349), 
fusion gene NPM-MLF1 resulting from translocation t(3;5)(q25.1;p34) (L49054), collagen alpha 3(IV) chain 
(H62466), T-lymphocyte specific protein tyrosine kinase p561ck(1ck) (U23852), insulin-like growth factor binding 
protein-4 (U20982), cathepsin B (L16510), cystatin C (T51534), dual specificity mitogen-activated protein kinase 

' kinase 2 (R42291), neuronal-glial cell adhesion molecule (T53118), PIM-1 proto-oncogene serine/threonine-pro- 
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tein kinase (H10925), carboxylesterase precursor (R54359), protein-tyrosine phosphatase PAC-1 (L11329), and 
interferon-induced protein 6-16 (X02492). 

10. An in-cell functional assay for a WT1 sequence alteration comprising the steps of: 

detecting expression of at least three down-stream genes in a target sample from target cells having a WT1 
sequence alteration and in a reference sample from reference cells having a wild-type WT1 gene, said refer- 
ence cells being otherwise substantially similar to said target cells, said down-stream genes being up- or down- 
regulated by said wild-type WT1 gene; and 

comparing said expression in said target sample to said expression in said reference sample, wherein a differ- 
ence in the expression between said two samples suggests that said WT1 sequence alteration affects the bio- 
logical function of WT1 . 

11. The method of claim 10, wherein said down-stream genes are transcriptionally regulated by said wild-type WT1 
gene and the expression of said down-stream genes js detected by measuring amounts of transcripts of said down- 
stream genes in said reference and target cells. 

12. The method of claim 1 1 , wherein said amounts of transcripts are measured with a high-density nucleic acid array. 

13. The method of claim 10, wherein said down-stream genes comprise at least one WT1 up-regulated gene selected 
from the group consisting of: natural killer cells protein 4 precursor (M59807), folate binding protein (M25317), 
HALPHA44 gene for alpha-tubulin (X06956), heat shock protein HSP70B (X51758), 90K product (H 17969), heat 
shock 70kd protein 1 (T66307), amphiregulin (M30704), procollagen alpha 1 (T51558), beta-migrating plasmino- 
gen activator inhibitor 1 (M 14083), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriu- 
retic protein (BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin, tubulin 
alpha-5 chain (H45051), purine nucleoside phosphorylase (T47964), Gem GTPase (U10550), adrenomedullin 
(D14874), GPI-anchored urokinase plasminogen activator surface receptor (R38636), tissue type plasminogen 
activator (X13097), gravin (M96322), jun-B (X51345), homeotic gene regulator (R16977), clone 9112 (X57348), 
interferon gamma treatment inducible mRNA (M26683), transmembrane receptor protein (Z17227), acidic FGF 
(X65779), cartilage linking protein-1(X17405), IL-11 (X58377) and mitochondrial phosphate carrier protein 
(R49231). 

1 4. The method of claim 1 0 wherein said down-steam genes comprise at least one WT1 down-regulated gene selected 
from the group consisting of caveolin and elongation factor 1 alpha-2 (X79490). 

15. The method of claim 13 wherein said down-steam genes further comprise p21 or EGFR. 

16. The method of claim 10 further comprising the steps of 

indicating that said WT1 sequence alteration is a loss of wild-type function mutation if the expression of said 
WT1 up regulated genes is at least two times less in said target cells than in said reference cells or if the 
expression of said WT1 down regulated genes is at least two times more in said target cells than in said refer- 
ence cells. 

17. A method for detecting a mutation in a target WT1 gene using a computer comprising: 

inputting into a computer wild-type expression data of at least three down-stream genes in a wild-type sample 
containing a wild-type WT1 gene, said down-stream genes being transcriptionally regulated by said wild-type 
WT1 gene; 

inputting into a computer target expression data of said plurality of down-stream genes in a target sample con- 
taining said target WT1 gene; 

comparing the target and wild-type expression data to determine differences, wherein differences suggest a 
mutation in ssid target WT1 gene. 

18. The method of claim 17, wherein said down-stream genes comprise at least one WT1 up-regulated gene selected 
from the group consisting of: natural killer cells protein 4 precursor (M59807), folate binding protein (M25317), 
HALPHA44 gene for alpha-tubulin (X06956), heat shock protein HSP70B (X51758), 90K product (H 17969), heat 
shock 70kd protein 1 (T66307), amphiregulin (M30704), procollagen alpha 1 (T51558), beta-migrating plasmino- 
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gen activator inhibitor 1 (M14083), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriu- 
retic protein (BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin, tubulin 
alpha-5 chain (H45051), purine nucleoside phosphorylase (T47964), Gem GTPase (U10550), adrenomedullin 
(D14874), GPI-anchored urokinase plasminogen activator surface receptor (R38636), tissue type plasminogen 
5 activator (X13097), gravin (M96322), jun-B (X51345), homeotic gene regulator (R16977), clone 9112 (X57348), 

interferon gamma treatment inducible mRNA.(M26683), transmembrane receptor protein (Z17227), acidic FGF 
(X65779), cartilage linking protein-1(X17405), IL-11 (X58377) and mitochondrial phosphate carrier protein 
H(R49231). 

" ' " w 19. The method of claim 17 wherein said down-stream genes comprise at least one WT1 down-regulated gene 
selected from the group consisting of caveolin and elongation factor 1 alpha-2 (X79490). 

20. The method of claim 18 wherein said down-stream genes further comprise p21 or EGFR. 

15 21. The method of claim 17 further comprising the step of: 

indicating a mutation in said target WT1 gene if the expression of said WT1 up-regulated genes is at least two 
times less in said target cells than in said reference cells or if the expression of said WT1 down-regulated 
genes is at least two times more in said target cells than in said reference cells. 

20 

22. A method for detecting a translocation fusing EWS and WT1 genes using a computer comprising: 

inputting into a computer wild-type expression data of a plurality of down-stream genes in a wild-type sample 
containing wild-type EWS and WT1 genes, said down-stream genes being transcriptionally regulated by a 
25 EWS-WT1 fusion protein; 

inputting into a computer target expression data of said plurality of down-stream genes in a target sample 
being tested for the presence of a EWS-WT1 fusion protein; 

comparing the target and wild-type expression data to determine differences, wherein differences suggest a 
translocation fusing the EWS and WT1 genes. 

23. The method of claim 22, wherein said down-stream genes comprise at least one EWS-WT1 up-regulated gene 
selected from the group consisting of: Pseudorabies virus nuclear antigen (R60906 and T79475)), beta chain of 
interleukin-2 receptor (M26062), G1/S-specific cyclin D1 (H20529), neuron-specific gamma-2 enolase (M22349), 
fusion gene NPM-MLF1 resulting from translocation t(3;5)(q25.1;p34) (L49054), collagen alpha 3(IV) chain 
(H62466), T-lymphocyte specific protein tyrosine kinase p561ck(1ck) (U23852), insulin-like growth factor binding 
protein-4 (U20982), cathespsin B, cystatin C (T51534), dual specificity mitogen-activated protein kinase kinase 2 
(R42291), neuronal-glial cell adhesion molecule (T53118), PIM-1 proto-oncogene serine/threonine-protein kinase 
(H10925)! carboxylesterase precursor (R54359), protein-tyrosine phosphatase PAC-1 (L11329), and interferon- 
induced protein 6-16 (X02492). 

24. A method of diagnosing neoplasia of a test cell comprising: 

hybridizing a transcription indicator of a test cell to a set of nucleic acid probes, wherein the transcription indi- 
cator is selected from the group consisting of mRNA, cDNA and cRNA, wherein the set of nucleic acid probes 
comprises a plurality of nucleic acid molecules each of which is a portion of a gene which is activated by or 
repressed by WT1 selected from the group consisting of: natural killer cells protein 4 precursor (M59807), heat 
shock protein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 1 (T66307), procollagen 
alpha 1 (T51558), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriuretic protein 
(BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin (R71870), purine 
nucleoside phosphorylase (T47964), adrenomedullin (D14874), gravin (M96322), jun-B (X51345), elongation 
factor 1 alpha-2 (X79490), homeotic gene regulator (R16977), clone 91 12 (X57348), interferon gamma treat- 
ment inducible mRNA (M26683), transmembrane receptor protein (Z17227), acidic FGF (X65779), cartilage 
linking protein-1(X17405), IL-11. (X58377) . and mitochondrial phosphate carrier protein (R49231); 
detecting amounts of transcription indicator which hybridize to each of said set of nucleic acid probes; 
identifying a test cell as neoplastic if (1) hybridization of the transcription indicator of the test cell to a probe 
which is a WT1 -activated gene is lower than hybridization using a transcription indicator from a normal cell, or 
(2) hybridization of the transcription indicator of the test cell to a probe which a WT1 -repressed gene is higher 
than hybridization using a transcription indicator from a normal cell. 
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25. The method of claim 24 wherein the test cell is a kidney cell. 

26. The method of claim 24 wherein at least 4 of said probes comprise portions of genes which are WT1 -activated or 
WT1 -repressed. 

27. The method of claim 24 wherein at least 10 of said probes comprise portions of genes which are WT1 -activated or 
WT1 -repressed. 

28. The method of claim 24 wherein at least 15 of said probes comprise portions of genes which are WT1 -activated or 
WT1 -repressed. 

29. The method of claim 24 wherein the nucleic acid probes are attached to a solid support. 

30. The method of claim 24 wherein the nucleic acid probes are arranged in an array. 

31. The method of claim 30 wherein the array comprises nucleic acid probes which are portions of at least 250 different 
genes. 

32. The method of claim 30 wherein the array comprises nucleic acid probes which are portions of at least 6000 differ- 
ent genes. 

33. The method of claim 24 further comprising the step of: 

determining the sequence of WT1 genes in the test cell to determine the WT1 genotypic status of the cell. 

34. The method of claim 24 wherein a test ceil is identified as neoplastic if hybridization is at least 2-fold different 
between compared samples. 

35. A method of identifying anti-cancer drugs, comprising the step of: 

contacting a test compound with a human cell; 

determining the expression of the cell of at least one WT1-up regulated gene or at least one WT1-down regu- 
lated gene, wherein the WT1 up-regulated genes are selected from the group consisting of: natural killer cells 
protein 4 precursor (M59807), heat shock protein HSP70B (X51758), 90K product (H17969), heat shock 70kd 
protein 1 (T66307), procollagen alpha 1 (T51558), cysteine protease inhibitor from radiated keratinocytes 
(X05978), brain natriuretic protein (BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 
cytoskeletal 17 keratin, purine nucleoside phosphorylase (T47964), adrenomedullin (D14874), gravin 
(M96322), jun-B (X51345), homeotic gene regulator (R16977), clone 9112 (X57348), interferon gamma treat- 
ment inducible mRNA (M26683), transmembrane receptor protein (Z17227), acidic FGF (X65779), cartilage 
linking protein-1(X17405), IL-11 (X58377) and mitochondrial phosphate carrier protein (R49231), wherein the 
WT1 -down regulated gene is elongation factor 1 alpha-2 (X70940); and 

identifying a test compound as a potential anti-cancer drug if it increases expression of at least one WT1 up- 
regulated gene or decreases expression of a WT1 down-regulated gene in the human cell. 

36. The method of claim 35 wherein the cell is a kidney cancer cell. 

37. The method of claim 35 wherein the cell is a tumor cell. 

38. The method of claim 35 wherein the cell contains no wild-type WT1 genes. 

39. The method of claim 35, the expression of said WT1 up-regulated and down-regulated genes is detected by meas- 
uring amounts of transcripts of said genes in said cell. 

40. The method of claim 39, wherein said amounts of transcripts are measured using a high-density nucleic acid array. 

41. A method of diagnosing neoplasia of a test cell comprising: 

hybridizing a transcription indicator of a test cell to a set of nucleic acid probes, wherein the transcription indi- 
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cator is selected from the group consisting of mRNA, cDNA and cRNA, wherein the set of nucleic acid probes 
comprises a plurality of nucleic acid molecules each of which is a portion of a gene which is activated by or 
repressed by a EWS-WT1 fusion protein; 

detecting amounts of transcription indicator which hybridize to each of said set of nucleic acid probes; 
5 identifying a test cell as neoplastic if (1) hybridization of the transcription indicator of the test cell to a probe 

which is a EWS-WT1 -activated gene is lower than hybridization using a transcription indicator from a normal 
cell, or (2) hybridization of the transcription indicator of the test cell to a probe which a EWS-WT1 -repressed 
gene is higher than hybridization using a transcription indicator from a normal cell. 

w 42. The method of claim 41 wherein the test cell is a Ewihg sarcoma cell. 

43. The method of claim 41 wherein at least 4 of said probes comprise portions of genes which are EWS-WT1 -acti- 
vated or EWS-WT1-represse>d. 

15 44. The method of claim 41 wherein at least 10 of said probes comprise portions of genes which are EWS-WT1 -acti- 
vated or EWS-WT1 -repressed. 

45. The method of claim 41 wherein at least 15 of said probes comprise portions of genes which are EWS-WT1 -acti- 
vated or EWS-WT1 -repressed. 

20 

46. The method of claim 41 wherein the nucleic acid probes are attached to a solid support. 

47. The method of claim 41 wherein the nucleic acid probes are arranged in an array. 

25 48. The method of claim 47 wherein the array comprises nucleic acid probes which are portions of at least 250 different 
genes. 

49. The method of claim 47 wherein the array comprises nucleic acid probes which are portions of at least 6000 differ- 
ent genes. 

30 

50. The method of claim 41 wherein at least one of the nucleic acid probes comprises a portion of a EWS-WT1 up- 
regulated gene selected from the group consisting of: Pseudorabies virus nuclear antigen (R60906 and T79475)), 
beta chain of interleukin-2 receptor (M26062), G1/S-specific cyclin D1 (H20529), neuron-specific gamma-2 eno- 
lase (M22349), fusion gene NPM-MLF1 resulting from translocation t(3;5)(q25.1;p34) (L49054), collagen alpha 

35 3(IV) chain (H62466), T-lymphocyte specific protein tyrosine kinase p56lck(lck) (U23852), insulin-like growth factor 
binding protein-4 (U20982), cathespsin B, cystatin C (T51534), dual specificity mitogen-activated protein kinase 
kinase 2 (R42291), neuronal-glial cell adhesion molecule (T53118), PIM-1 proto-oncogene serine/threonine-pro- 
. tein kinase (H10925), carboxylesterase precursor (R54359), protein-tyrosine phosphatase PAC-1 (L11329), and 
interferon-induced protein 6-1 6 (X02492). 

40 

* 51. The method of claim 41 wherein a test cell is identified as neoplastic if hybridization is at least 2-fold different 
between compared samples. 

52. The method of claim 41 wherein a test cell is identified as neoplastic if hybridization is at least 3-fold different 
45 between compared samples. 

53. The method of claim 41 wherein a test cell is identified as neoplastic if hybridization is at least 5-fold different 
between compared samples. 

so 54. A method of identifying anti-cancer drugs, comprising the steps of: 

contacting a test compound with a human cell; 

determining expression of at least one EWS-WT1 up-regulated genes or at least one EWS-WT1 down-regu- 
lated gene in the human cell; 

55 identifying a test compounds an anti-cancer drug if it reduced expression of an EWS-WT1 up-regulated gene 

or increases expression of an EWS-WT1 down-regulated gene. 



55. The method of claim 54 wherein the cell is a kidney cancer cell. 



EP 1 036 845 A2 



56. The method of claim 54 wherein the cell is a tumor cell. 

57. The method of claim 54, the expression of said WT1 up-regulated and down-reguiated genes is detected by meas- 
uring amounts of transcripts of said genes in said cell. 

5 

58. The method of claim 57, wherein said amounts of transcripts are measured using a high-density nucleic acid array. 

59. The method of claim 54, wherein said WT1 up-regulated gene is selected from the group consisting of Pseudora- 
bies virus nuclear antigen (R60906 and T79475)), beta chain of interieukin-2 receptor (M26062), G1/S-specific 

w cycliri D1 (H20529), neuron-specific gamma-2 enolase (M22349), fusion gene NPM-MLF1 resulting from translo- 
cation t(3;5)(q25.1;p34) (L49054), collagen alpha 3(IV) chain (H62466), T-lymphocyte specific protein tyrosine 
kinase p56lck(lck) (U23852), insulin-like growth factor binding protein-4 (U20982), cathespsin B, cystatin C 
(T51534), dual specificity mitogen-activated protein kinase kinase 2 (R42291), neuronal-glial cell adhesion mole- 
cule (T53118), PIM-1 proto-oncogene serine/threonine-protein kinase (H10925), carboxylesterase precursor 

15 (R54359), protein-tyrosine phosphatase PAC-1 (L1 1329), and interferon-induced protein 6-16 (X02492). 

60. A method for detecting a WT1 gene functional mutation in target cells comprising the steps of: 

detecting expression of at least one down-stream gene of WT1 in a sample of (a) target cells, and (b) reference 
20 cells having a wild-type WT1 gene, said reference cells being otherwise substantially similar to said target 

cells, said down-stream gene being selected from the group consisting of: natural killer cells protein 4 precur- 
sor (M59807), folate binding protein (M25317), HALPHA44 gene for alpha-tubulin (X06956), heat shock pro- 
tein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 1 (T66307), amphiregulin (M30704), 
procollagen alpha 1 (T51558), beta-migrating plasminogen activator inhibitor 1 (M14083), cysteine protease 
25 inhibitor from radiated keratinocytes (X05978), brain natriuretic protein (BNP) (M31766), leukemia virus recep- 

tor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin, tubulin alpha-5 chain (H45051), purine nucleoside 
phosphorylase (T47964), Gem GTPase (U10550), adrenomedullin (D14874), GPI-anchored urokinase plas- 
minogen activator surface receptor (R38636), tissue type plasminogen activator (X13097), gravin (M96322), 
jun-B (X51345), elongation factor 1 alpha-2 (X79490), homeotic gene regulator (R16977), clone 9112 
30 (X57348), interferon gamma treatment inducible mRNA (M26683), transmembrane receptor protein (Z1 7227), 

acidic FGF (X65779), cartilage linking protein-1 (X1 7405), IL-1 1 (X58377) mitochondrial phosphate carrier pro- 
tein (R49231), and caveolin; and 

comparing the expression of said at least one down-stream gene in said target cells and said reference cells, 
wherein a difference in said expression between the target cells and reference cells suggests a WT1 functional 
35 mutation in the target cells. ■ • • 

61. An in-cell functional assay for a WT1 sequence alteration comprising the steps of: 

detecting expression of at least one down-stream gene selected from the group consisting of: natural killer cells 
40 protein 4 precursor (M59807), folate binding protein (M25317), HALPHA44 gene for alpha-tubulin (X06956), 

heat shock protein HSP70B (X51758)* 90K product (H 17969), heat shock 70kd protein 1 (T66307), amphireg- 
ulin (M30704), procollagen alpha 1 (T51558), beta-migrating plasminogen activator inhibitor 1 (M14083), 
cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriuretic protein (BNP) (M31766), 
leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletal 17 keratin, tubulin alpha-5 chain (H45051), 
45 purine nucleoside phosphorylase (T47964), Gem GTPase (U10550), adrenomedullin (D14874), GPI-anchored 

urokinase plasminogen activator surface receptor (R38636), tissue type plasminogen activator (X13097), 
gravin (M96322), jun-B (X51345), elongation factor 1 alpha-2 (X79490), homeotic gene regulator (R16977), 
clone 9112 (X57348), interferon gamma treatment inducible mRNA (M26683), transmembrane receptor pro- 
tein (Z17227), caveolin , acidic FGF (X65779), cartilage linking protein-1 (X1 7405), IL-1 1 (X58377) and mito- 
50 chondrial phosphate carrier protein (R49231) in a target sample from target cells having a WT1 sequence 

alteration and in a reference sample from reference cells having a wild-type WT1 gene, said reference cells 
being otherwise substantially similar to said target cells, said down-stream genes being up- or down-regulated 
by said wild-type WT1 gene; and 

comparing said expression in said target sample to said expression in said reference sample, wherein a differ- 
55 ence in the expression between said two samples suggests that said WT1 sequence alteration affects the bio- 

logical function of WT1 . 

62. A method for detecting a mutation in a target WT1 gene using a computer comprising: 
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inputting into a computer wiid-type expression data of at least one WT1 -down-stream gene in a wild-type sam- 
ple containing a wild-type WT1 gene, wherein said down-stream gene is selected from the group consisting of: 
natural killer cells protein 4 precursor (M59807), folate binding protein (M25317), HALPHA44 gene for alpha- 
tubulin (X06956), heat shock protein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 1 
(T66307), amphiregulin (M30704), procollagen alpha 1 (T51558), beta-migrating plasminogen activator inhib- 
itor 1 (M14083), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriuretic protein 
(BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletai 17 keratin, tubulin alpha-5 
chain (H45051), purine nucleoside phosphorylase (T47964), Gem GTPase (U10550), adrenomedullin 
(D14874), GPI-anchored urokinase plasminogen activator surface receptor (R38636), tissue type plasminogen 
activator (X13097), gravin (M96322), jun-B (X51345), elongation factor 1 alpha-2 (X79490), homeotic gene 
regulator (R16977), clone 91 12 (X57348), interferon gamma treatment inducible mRNA (M26683), transmem- 
brane receptor protein (Z17227), caveolin, acidic FGF (X65779), cartilage linking protein-1(X17405), IL-11 
(X58377) and mitochondrial phosphate carrier protein (R49231); and 

inputting into a computer target expression data of said plurality of down-stream genes in a target sample con- 
taining said target WT1 gene; 

comparing the target and wild-type expression data to determine differences, wherein differences suggest a 
mutation in said target WT1 gene. 

63. A method of diagnosing neoplasia of a test cell comprising: 

determining expression levels in a test cell and in a control cell of at least one gene which is activated by or 
repressed by WT1 selected from the group consisting of: natural killer cells protein 4 precursor (M59807), heat 
shock protein HSP70B (X51758), 90K product (H17969), heat shock 70kd protein 1 (T66307), procollagen 
alpha 1 (T51558), cysteine protease inhibitor from radiated keratinocytes (X05978), brain natriuretic protein 
(BNP) (M31766), leukemia virus receptor 1 (GLVR1) (L20859), type 1 cytoskeletai 17 keratin (R71870), purine 
nucleoside phosphorylase (T47964), adrenomedullin (D14874), gravin (M96322), jun-B (X51345), elongation 
factor 1 a!pha-2 (X79490), homeotic gene regulator (R16977), clone 91 12 (X57348), interferon gamma treat- 
ment inducible mRNA (M26683), transmembrane receptor protein (Z17227), acidic FGF (X65779), cartilage 
linking protein-1(X17405), IL-1 1 (X58377) and mitochondrial phosphate carrier protein (R49231); 
comparing the determined expression levels in the test cell to the determined expression levels in the control 
cell; 

identifying the test cell as neoplastic if (1) the level of expression of at least one of said genes which is a WT1- 
activated gene is lower in the test cell than in the control cell, or (2) the level of expression of at least one of 
said genes which is WT1 -repressed is higher in the test cell than in the control cell. 

64. A method of diagnosing neoplasia of a test cell comprising: 

determining expression levels in a test cell and in a control cell of at least one gene which is activated by or 
repressed by a EWS-WT1 fusion protein; 

comparing the determined expression levels in the test cell to the determined expression levels in the control 
cell; 

identifying a test cell as neoplastic if (1) expression in the test cell of a EWS-WT1 -activated gene is lower than 
expression in a normal cell, or (2) expression in the test cell of a EWS-WT1 -repressed gene is higher than 
expression in the normal cell. 

65. A method of diagnosing neoplasia of a test cell comprising: 

determining expression levels in a test cell and in a control cell of at least three genes which are activated by 
or repressed by WT1 ; 

comparing the determined expression levels in the test cell to the determined expression levels in the control 
cell; 

identifying a test cell as neoplastic if (1) expression in the test cell of a WT1 -activated gene is lower than 
expression in a normal ceil, or (2) expression in the test cell of a WT1 -repressed gene is higher than expression 
in the normal cell. 



